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Abstract — This paper considers peer-to-peer scheduling for a 
networlt with multiple wireless devices. A subset of the devices 
are mobile users that desire specific files. Each user may already 
have certain popular flies in its cache. The remaining devices 
are access points that typically have access to a larger set of 
files. Users can download packets of their requested file from an 
access point or from a nearby user. Our prior work optimizes 
peer scheduling in a general setting, but the resulting delay can be 
large when applied to mobile networks. This paper focuses on the 
mobile case, and develops a new algorithm that reduces delay 
by opportunistically grabbing packets from current neighbors. 
However, it treats a simpler model where each user desires 
a single file with infinite length. An algorithm that provably 
optimizes throughput utility while incentivizing participation is 
developed for this case. The algorithm extends as a simple 
heuristic in more general cases with finite file sizes and random 
active and idle periods. 



I. Introduction 

Consider a network with N wireless devices. Let K of 
these devices be identified as users, where K < N. The 
users can send packets to each other via direct peer-to-peer 
transmissions. Each user has a certain collection of popular 
files in its cache. The remaining N — K devices are access 
points. The access points are connected to a larger network, 
such as the internet, and hence typically have access to a 
larger set of files. While a general network may have users 
that desire to upload packets to the access points, this paper 
focuses only on the user downloads. Thus, throughout this 
paper it is assumed that the access points only send packets 
to the users, but do not receive packets. In contrast, the users 
can both send and receive. The network is mobile, and so 
the transmission options between access points and users, and 
between user pairs, can change over time. 

Each user only wishes to download, and does not naturally 
want to send any data. Users will only send data to each 
other if they agree to operate according to a control algorithm 
that schedules such transmissions. This paper assumes the 
users have already agreed to abide by the control algorithm, 
and thus focuses attention on altruistic network design that 
optimizes a global network utility function. Nevertheless, this 
paper includes a tit-for-tat constraint, similar to the work in 
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[1], which is an effective mechanism for incentivizing partic- 
ipation. Further, while this paper develops a single algorithm 
for optimizing the entire network, this does not automatically 
require the algorithm to be centralized. Indeed, the resulting 
algorithm often has a distributed implementation. 

The model of this paper applies to a variety of practical 
network situations. For example, the access points can be 
wireless base stations in a future cellular network that allows 
both base-station-to-user transmissions as well as direct user- 
to-user transmissions. Alternatively, some of the access points 
can be smaller femtocell nodes. To increase the capacity of 
wireless systems, it is essential for future networks to enable 
such femtocell access and/or direct user-to-user transmission. 
This paper considers only 1-hop communication, so that all 
downloads are received either directly from an access point 
or from another user The possibility of a user acting as a 
multi-hop relay is not considered here. 

Our prior work [1] treats a more complex model where 
each user can actively download multiple files at the same 
time, and where the arrival process of desired files at each 
user is random. A key challenge in this case is the complexity 
explosion associated with labeling each file according to the 
subset of other devices that already have it. This is solved in 
[1] by first observing the subset information of each newly 
arriving file, making an immediate decision about which 
device in this subset should transmit the desired packets, and 
then placing this request in a request queue at that selected 
device. The devices are not required to transmit these packets 
immediately. Rather, they can satisfy the requests over time. 
This procedure does not sacrifice optimality, and yields an 
algorithm with polynomial complexity. However, while this 
is effective for networks with static topology, it can result in 
significant delays in mobile networks. That is because a device 
that is pre-selected for transmission may not currently be in 
close proximity to the intended user, and/or may move out of 
transmission range before transmission occurs. 

The current paper provides an alternative algorithm that 
reduces delay, particularly in the mobile case. To do so, we use 
a simpler model that assumes each user desires only one file 
that has infinite size. This enables us to focus on scheduling 
to achieve optimally fair download rates for each user Rather 
than pre-selecting devices for eventual transmission, our algo- 
rithm makes opportunistic packet transmission decisions from 
the set of current neighbors that have the desired file. This 
also facilitates distributed implementation. The infinite file size 
assumption is an approximation that is reasonable when file 
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sizes are large, such as for video files. A heuristic extension 
to the case of finite file sizes is treated in Section |V] This 
heuristic is based on the optimality insights obtained from the 
infinite size case. 

Prior work on fair scheduling in mobile ad-hoc networks 
has considered token-based and economics-based mechanisms 
to incentivize participation [2] [3] [4]. Incentives are also well 
studied in the peer-to-peer literature. For example, algorithms 
in [5] [6] [7] [8] track the number of uploads and downloads 
for each user, and give preferential treatment to those who 
have helped others. Algorithms based on tokens, markets, and 
peer reputations are considered in [9] [ 1 0] [ 1 1 ] . Such algorithms 
are conceptually based on the simple "tit-for-tat" or "treat-for- 
treat" principle, where rewards are given in direct proportion 
to the amount of self-sacrificial behavior at each user. Our 
current paper also considers a tit-for-tat mechanism. However, 
a key difference is that it designs a tit-for-tat constraint directly 
into the optimization problem (similar to our prior work [1] 
that used a different network model). Remarkably, the solution 
of the optimization naturally results in an intuitive token-like 
procedure, where the number of tokens in a virtual queue 
at each user determines the reputation of the user Peer-to- 
peer transmissions between user pairs are given preferential 
treatment according to the differential reputation between the 
users. This is similar to the backpressure principle for optimal 
network scheduling [12] [13]. However, the "backpressure" in 
the present paper is determined by token differentials in virtual 
reputation queues, rather than congestion differentials. 

II. Basic Model 

Let M represent the set of devices, and JC represent the set 
of users, where JC C J\f. Let N and K be the sizes of these 
sets. For simplicity in this basic model, assume that each user 
k G K. wants a single file that consists of an infinite number 
of fixed-length packets (this assumption is modified to treat 
finite file sizes in Section |V]i. For each k G IC, define 
as the subset of devices in J\f that have the file desired by 
user k. The set Tk can include both users and access points, 
and represents the set of devices that user k can potentially 
receive packets from as it moves throughout the network. If 
each user k only accepts downloads from a certain subset of 
devices that it identifies as its social group, then J^k can be 
viewed as the intersection of its social group and the set of 
devices that have its file. We assume that k ^ Tk, so that no 
user wants a file that it akeady has. Further, we assume J^k 
is non-empty, so that at least one device has the desired file. 
This latter assumption is reasonable if the network has one or 
more access points with high speed internet connections. Such 
access points can be modeled as having all desired files. 

The network operates in slotted time with time slots t E 
{0,1,2,...}. Every slot t, the network makes transmission 
actions. Let {fj.nk{t)) be the matrix of transmission actions 
chosen on slot t, where each entry fJ,nk{t) is the number of 
packets that device n E Af transmits to user k E K.. The set 
of all possible matrix options to choose from is determined 
by the current topology state of the network, as in [13]. 
Specifically, let represent the topology state on slot t. 



being a vector of parameters that affect transmission, such as 
current device locations and/or channel conditions. Assume 
ijj{t) takes values in an abstract set O, possibly being an 
infinite set. The ijj{t) process is assumed to be ergodic. In 
the case when Vl is finite or countably infinite, the steady 
state probabilities are represented by 7r(tj) — Pr[uj{t) — 
uj] for all u! E ft. Else, the steady state probabilities are 
represented by an appropriate probability density function. 
These probabilities are not necessarily known by the network 
controller Extensions to the case when uj{t) is non-ergodic 
are explored in Sections [V] and |VI] 

For each u E ft, define Tl{uj) as the set of all transmission 
matrices (fink) that are possible when uj{t) — uj. The exact 
structure of TZ{ll!) depends on the particular physical charac- 
teristics and interference properties of the network. One may 
choose the TZ{uj) sets to constrain the transmission variables 
Unk to take integer values, although this is not required in our 
analysis. We assume only that each set TZ{uj) has the following 
basic properties: 

> Every matrix (fink) G Tl{uj) has non-negative entries. 

• If (Mrifc) e TZ{uj), then (fink) e 7^(w), where (fink) is 
any matrix formed by setting one or more entries of (fink) 
to zero. 

« Every matrix {^nk) G must satisfy the constraint 

< EnGA^Mnfc < 2;'^"'''= for all k E IC, where x'^""^ 
is a given bound on the number of packets that can be 
delivered to user k on one slot, regardless of w. 

A. Example Network Structure 

This section presents an example network model that fits 
into the above framework. The network region is divided 
into C non-overlapping subcells. Let c„(t) be the current 
subcell of device n, so that c„(t) E {1, . . . , C}. Let c{t) = 
(ci(t), . . . , CAr(t)) be the vector of device locations on slot t. 
The access points have fixed locations, while the mobile users 
can change subcells from slot to slot. Let S{t) = {Snk{t)) be 
a channel state matrix, where Snk (t) is the number of packets 
that device n can transmit to device k on slot t, provided 
that there are no competing transmissions (as defined below). 
The value Snk{t) can depend on the c{t) location vector. 
Let the topology state be given by uj{t) — {c{t),S{t)). 
Assume access point transmissions are orthogonal from all 
other access point transmissions and from all peer-to-peer user 
transmissions. For each uj{t), define Tl{uj{t)) as the set of all 
(/Ltrifc(O) matrices with entries that satisfy: 

• Mnfe(i) e {0, Snk{t)} for all n e A/" and k E IC. 

m Users can only transmit to other users currently in their 
same subcell, so that fink{t) ~ whenever n E JC, k E 
JC, and Cn(t) ^ Ck{t). 

m At most one user-to-user transmission can take place per 
subcell on a given slot. 

• Each access point can send to at most one user per slot. 
This particular structure is useful because it allows user 
transmissions to be separately scheduled in each subcell, and 
access point transmissions to be scheduled separately from all 
other decisions. The sets Tl{uj{t)) can be defined differently 
for more sophisticated interference models. For example, the 
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transmission rate of an access point can depend on whether or 
not neighboring access points are scheduled for transmission. 

B. Optimization Objective 

For each a E J\f and b E K,, define fab{t) to be 1 if, on slot 
t, device a has the file requested by user b, and otherwise: 



fabityi 



if a e J'b 
otherwise 



(1) 



Because each user desires a single infinite size file, the fab{t) 
values do not change with time. However, we use the "(i)" 
notation because it facilitates the extension to more general 
cases in Section [V] where the sets Ti, in the right-hand-side of 
dl) are extended to J-b{t). Every slot t, the network controller 
observes Lu{t) and chooses (/i„fc(^)) G T^i'^it))- For each k e 
JC, define Xk{t) as the total number of packets that user k 
receives from others on slot t, and define yk{t) as the total 
number of packets that user k delivers to others on slot t: 



yk{t)^Y.beicl^kb{t)fkb{t) 



(2) 
(3) 



The multiplication fiab{t)fab{t) in Q and (O formally ensures 
that user b can only receive a packet from another device that 
has the file it is requesting. 

For a given control algorithm, let xk and Ijf. represent the 
time averages of the Xk{t) and yk{t) processes for all k E JC: 

Xk = limf^oo 7 Et=o ^k{T) , Vk linit^oo \ Et=o Vki'^) 

These limits are temporarily assumed to existQ The value 
Xk is the time average download rate of user fc, and y^. 
is the time average upload rate. The goal is to develop a 
control algorithm that solves the following stochastic network 
optimization problemjl 

Maximize: Y.keK^k{xk) (4) 

Subject to: akXk < Pk + Vk Vfc G /C (5) 

{^ink{t)) e n{Lo{t)) yt e {o, 1,2,.. .} (6) 

where for each k E JC, (t>k{x) are given concave functions 
and ak^Pk are given non-negative weights. The value 4>k{xk) 
represents the utility associated with user k downloading at 
rate Xk- The constraints (|5]l are the tit-for-tat constraints from 
[1]. These constraints incentivize participation. They allow a 
"free" download rate of jik/ak- Users can only receive rates 
beyond this value in proportion to the rate at which they help 
others. The value j3k can be set to zero to remove the "free" 
rate, and the value ak can be set to to remove the tit-for-tat 
constraint for user k. Choosing larger values of ak (typically 
in the range < < 1) leads to more stringent requirements 
about helping others. These tit-for-tat constraints restrict the 
system operation and thus can affect overall network utility. 
Removing these constraints by setting ak = for all k 
leads to the largest network utility, but does not embed any 
participation incentives into the optimization problem. 

' This is only to simplify exposition of the optimization goal. The analysis 
in later sections does not a-priori assume the limits exist. 
^Note that the trivial all-zero solution is always feasible. 



The functions 4>k{x) are assumed to be concave, continu- 
ous, and non-decreasing over the interval x > 0. They are 
not required to be differentiable. For example, they can be 
piecewise linear, such as (t>k{x) = min[x,6'fc], where 6k is a 
given constant rate desired by user k. Alternatively, one can 
choose 4)k{x) = ln(x) for each k E JC, which leads to the 
well known proportional fairness utility [14]. 

One may want to modify the problem dU-® by specifying 
separate utility functions for the user-to-user download rates 
and the access-point-to-user download rates. This is possible 
by creating two "virtual users" mi(fc) and m2(fc) for each 
actual user k E JC. Channel conditions for the virtual users 
mi (fc), 777-2 (fc) are defined to be the same as for the actual 
user k, with the exception that virtual user mi{k) is restricted 
to receive only from other users, while virtual user 7712 (fc) is 
restricted to receive only from the access points. 



III. The Dynamic Algorithm 

The problem dUl-® is solved via the stochastic network 
optimization theory of [13] [15]. First note that problem (|4]l- 
(|6]l is equivalent to the following problem that uses auxiliary 
variables 7fc(0- 

Maximize: Y^keK^kilk) (7) 

Subject to: ctkXk < fik+Vk Vfc e /C (8) 

7fc < xfc Vfc e /C (9) 

(AinfcW) e e {0, 1,2,.. .} (lO) 

0<7fcW<4""" Vte {0,1,2,...} (11) 

The auxiliary variables 7fc(t) act as proxies for the actual 
download variables Xk{t). This is useful for separating the 
nonlinear utility optimization from the network transmission 
decisions. 

It can be shown that the optimal utility value is the same 
for both problems dUi-© and ©-(dUl [15]. Let 0* denote 
this optimal utility. Now consider an algorithm that solves 
the problem (TTll-lfTTli. Let (/i„fc(i)), (7A;(i)) be the decisions 
made over time, and let 'xk, yk, Jk the corresponding time 
averages, all of which satisfy the constraints of the problem 
(iTli- dTTl l. Note that these constraints include all of the desired 
constraints of the original problem (IHl-®. Then: 



< Y.keic'l^kixk) 

< 6* 



(12) 
(13) 
(14) 



where ([T2]) holds because this algorithm achieves the optimal 
utility </>*, ( fTsl l holds because this algorithm must yield time 
averages that satisfy 7^, < Xk for all k £ JC, and ( fT4l l holds 
because the transmission decisions of the algorithm satisfy all 
desired constraints of the original problem, and thus produce 
Xk values that give a utility that is less than or equal to the 
optimal utility of the original problem (which is also (f>*). It 
follows that any algorithm that is optimal for (l7Ti-(fTTTi makes 
decisions that are also optimal for the original problem. 
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A. Virtual Queues 

To facilitate satisfaction of the tit-for-tat constraints dS), for 
each k £ K, define a virtual queue Hk{t), with dynamics: 

Hk{t + 1) = miix[Hk{t) + akXkit) ~ Pk ~ yk{t),0] (15) 

where Xk{t), yk{t) are defined in ©-(O. The intuition is 
that akXk{t) can be viewed as the "arrivals" on slot t, and 
Pk + Vk{t) can be viewed as the "offered service" on slot 
t. Stabilizing queue Hk{t) ensures the time average of the 
"arrivals" is less than or equal to the time average of the 
"service," which ensures constraints (|8]l. 

Similarly, to satisfy the constraints (|9]l, for each k E K, 
define another virtual queue Qk{t) with dynamics: 

Qk{t + l)= max[Qfc(t) +7fe(t) - Xkit),0] (16) 

The update ( fTSI l can be interpreted as a queueing equation 
where jk{t) is the amount of data requested by user k on 
slot t, and Xk{t) is the amount of service. Stabilizing Qk{t) 
ensures 7^. < xk- 



B. The Drift-Plus-Penalty Algorithm 

Define the following quadratic function L{t): 

mHEkeAQkitf + H,m 

Intuitively, taking actions to push L{t) down tends to maintain 
stability of all queues. Define A(i) as the drift on slot t: 

A{t)AL{t + 1) - L{t) 

Let @{t) = {Qk{t),Hk{t))\keK be the vector of all virtual 
queue values on slot t. The algorithm is designed to observe 
the queues and the current on each slot t, and to then 
choose {fJ.nk{t)) G TZ{uj{t)) and 7fe(t) subject to < 7fc(i) < 
j,iriax minimize a bound on the following drift-plus-penalty 
expression [15]: 

m-VEkeKMlk{t)) 

where is a non-negative weight that affects a performance 
bound. Intuitively, the value of V affects the extent to which 
our control action on slot t emphasizes utility optimization in 
comparison to drift minimization. 

Lemma 1: Under any control algorithm, we have: 

AW - l^EfceK Mlk{t)) < B{t) - l^EfceK<^fc(7fe W) 
+Y.keK Hk{t)[akXk{t) - l3k~ Vk{t)] 

+Y.keKQk{i)[lk{t) - Xk{t)] (17) 

where B{t) is defined: 

B(t) 4 \Y.keKi^kXk{t)~fik-Vk{t)? 

Proof: Squaring ( fTsT i and using max[j/,0]^ < for any 
real number y yields: 

Hk{t + lf < Hk{tf + (akXkit) ~ - yk{t)f 
2Hk{t){akXk{t) - Pk - yk{t)) 



Similarly, squaring ( fT6] l gives: 

Qk{t + lf < Qkit)^ + hkit)-xk{t))^ 
+2Qk{t)iikit) ~ Xk{t)) 

Summing over k G JC and dividing by 2 yields the result. □ 
The value of B{t) can be upper bounded by a finite constant 
B every slot, where B depends on the maximum possible 
values that ^J,nk{t) and jkit) can take. The algorithm below 
is defined by observing the queue states and uj{t) every slot 
t, and choosing actions to minimize the last three terms on 
the right-hand-side of (not including the first term B{t)), 
given these observed quantities. Specifically, every slot t: 

• (ikit) decisions) Each user fc G /C observes Qkit) and 
chooses 7fe(t) to solve: 

Maximize: V(j)k{ik{t)) - Qk{t)ik{t) 
Subject to: < 7fc(t) < x^"'^ 

• (Pnkit) decisions) The network controller observes all 
queues {Q{t),H{t)) and the topology state uj{t) on slot 
t, and chooses matrix G Tl{uj{t)) to maximize 
the following expression: 

J:ne^fJ:keK^^nk{t)fnk{t)Wnk{t) (18) 

where weights Wnk{t) are defined: 

Wnkit)^ [Qkit) + l{neK}Hn{t) " akHk{t)\ 

where l^^i^x.} ™ indicator function that is 1 if device 
n is a user, and if device n is an access point. 

• (Queue updates) Update virtual queues Hk{t) and Qk{t) 
for all fc e /C via dTSj and (fT6] l. 

The 7fe(t) decisions can be viewed as flow control actions 
that restrict the amount of data requested from user k on each 
slot. They are made separately at each user k. The {nnk{t)) 
decisions are transmission actions made at the network layer. 
Examples are given below. 

C. Example Flow Control Decisions 

Suppose we have the following piecewise linear utility 
functions for all fc e /C: 



(t^kixk) = Vkm.six[xk,9k] 



(19) 



where Vk are given positive values that act as priority weights 
for the users, and 0^ are are given positive values that represent 
the maximum desired communication rate for each user. 
Assume that Ok < x'^"'^ for all k G K.. Then the algorithm in 
the previous section chooses 7fe(t) for each user fc e /C as: 

9k if Qk{t)<Vvk 
otherwise 



lk{t) 



Alternatively, suppose we have the following strictly con- 
cave utility functions for all k £ JC: 



(pkixk) = ln(l + VkXk) 
Then the ^k{t) decisions are: 

' V I' '"^'^^ 



lk{t) 



kit) 



Vk 



(20) 



(21) 
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where the operation [j/]q is equal to i/ if < y < 6, if 
y < 0, and b if y > b. These utility functions can be viewed 
as an accurate approximation of the proportionally fair utility 
function if we use — v for all k, for a large value of v. 

Finally, using the proportionally fair utilities 4)x{xk) — 
\n{xk) for all fc e /C leads to 7fe(t) = [V /Qk{t)]'^''''\ which 
is indeed the same as (1211 1 in the limit as Vk ^ oo. 

D. Example Transmission Decisions 

Suppose the network has the special structure specified in 
Section III-AI Let A be the set of access points. The set A is 
disjoint from the user set /C, and AlJ JC = N. Let K.a{t) be 
the set of users within reach of access point a on slot t. Then 
each access point a ^ A observes channels Sak (t) and queues 
Qk{t), Hk{t) for all users k £ JCa{t) and chooses to serve 
the single user in JCa{t) with the largest non-negative value of 
fak{t)Sak{t)[Qk{t) - akHkit)] (breaking ties arbitrai'ily), and 
chooses no users if this value is negative for all k £ JCa{t). 

Further, the user pairs in each subcell c G {1,...,C} 
are observed. Amongst all users in a given subcell, the 
ordered user pair (a, k) with the largest non-negative value of 
fak{t)Sak{t)[Qk{t) + Ha{t) - akHk{t)] is selected for peer- 
to-peer transmission in that subcell (breaking ties arbitrarily). 
No peer-to-peer transmission occurs in the subcell if this value 
is negative for all user pairs. This can be coordinated by a 
controller in each subcell, or can be done by having each user 
first transmit control information from which the winning user 
pair is determined. 

IV. Algorithm Performance 

The algorithm fits into the stochastic network optimization 
framework of [15], and hence it satisfies all desired constraints 
of the original problem (|4|i-(|6ll (as well as the transformed 
problem (fTt-lfTTTi). with an overall utility value that differs 
from the optimal 0* by at most 0{1/V), which can be made 
arbitrarily small as the parameter V is increased. However, 
the V parameter directly affects the size of the virtual queues, 
which affects convergence time of the algorithm to this util- 
ity. Here we show that if the utility functions and network 
transmission sets have additional structure, the virtual queues 
Qk [t) and Hk [t) can be deterministically bounded for all time 
by a constant that is proportional to V . 

A. Bound on Data Queues Qk{t) 

Suppose each utility function 0^ {t) has right-derivatives that 
are bounded by a finite constant Vk > over the interval < 
Xk < x™"^. For example, this holds for the piecewise linear 
utility functions (fT9] l and the strictly concave utility functions 
(|20] |. with the h'k parameters specified there indeed being the 
maximum right derivatives. 

Lemma 2: If utility function (j)k{x) has maximum right 
derivative i^k > 0, then: 

< Qk{t) < Vuk + x^"" Vt e {0, 1,2,.. .} 

provided that this inequality holds for (5fe(0). 

Proof: Assume that Qk{t) < Vvk+x^"'^ for slot t (it holds 
by assumption on slot t — 0). We prove it also holds for slot 



i + 1. First consider the case Qk{t) < Vvk- From the queue 
update equation ( fT6l l. we see that this queue can increase by 
at most x™"^ on each slot, and so we have Qk{t + 1) < 
Vvk + x™"^, proving the result for this case. 

Now consider the case Qk{t) > Vvk- On slot t, the algo- 
rithm chooses 7fc(t) G [0,x™°^] to maximize the expression: 

VMlk{t))-Qk{t)ik{t) 

However, for any 7fe(i) > we have: 

V<i3k{ik{t))~Qk{t)-ik{t) 

< V(f>k (0) + Vi^klk (t) - Qk {thk {t) 

= vMo) + ik{t)[Vi^k-Qk{t)] 

< VMO) 

where equality holds if and only if 7^ = (recall that [Vi'k — 
Qk{t)] < 0). It follows that the algorithm must choose jk{t) = 
0, and so Qk{t) cannot increase on this slot. That is: 

Qkit + l)<Qk(t)<Viyk+xr'' 

□ 

B. Bound on Reputation Queues Hk (t) 

The Hk{t) processes act as reputation queues for each user 
k £ JC, being low if user k has a good reputation for helping 
others, and high otherwise (see queue dynamics in (fTSll). These 
reputations directly affect the transmission decisions via the 
weights Wnk{t) in (fTSl l. To see this, define A as the set of 
access points. First consider the weight seen by an access point 
a £ A for user k on slot t: 

Q kit) - akHkit) 

This weight is large if -fffe(t) is small (meaning user k has 
a good reputation). The next lemma shows that if utility 
functions have bounded right-derivatives Vk, then access points 
will refuse to send to any user if its Hkit) value exceeds a 
threshold. 

Lemma 3: If (j)kit) has maximum right-derivative i^k > 0, 
and if initial queue backlog satisfies < QkiO) < Vvk + 
rf-max^ then no access point will send to user fc on a given slot 
t if Hkit) > ^[Vuk + x^^% 

Proof: Lemma |2] ensures Qkit) < Vvk + x™"^ for all t. 
It follows that if Hkit) > -^[Vvk + a;"""^], then the weight 
seen by an access point a £ A for user k satisfies: 

Qkit) - akHkit) < Vvk+xr^" - akHkit) 
< 

Because the weight is negative, the max-weight functional (fTSl ) 
is maximized by choosing fiakit) = 0, so that access point a 
will not send data to user k on slot t. □ 
Now suppose user u £ JC considers transmitting to another 
user k £ K.. User u sees the weight: 

Qkit) + H^it) - akHkit) 

The value H^it) — akHkit) can be viewed as a differential 
reputation. We again see that a relatively low value of Hkit) 
improves the weights for user k. The next lemma shows that 



6 



all queues (t) are deterministically bounded. For simplicity, 
we state the lemma under the assumption that all initial queue 
backlogs are zero. 

Lemma 4: If all utility functions have right-derivatives 
bounded by finite constants Vk > 0, if (3k > for all fc e /C, 
and if initial backlog satisfies Qk{0) = -fffc(O) — for 
all k E K,, then there are finite constants Ci and C2, both 
independent of V, such that: 

\\@m<Ci + C2V yte {0,1,2,...} 

where ||0(i)|| is defined: 

II0WII < v^keKHk{tr + }:keKQk{ty 

The constants Ci and C2 are given in the proof. 

Proof: See Appendix. □ 

C. Constraint Satisfaction 

The deterministic queue bounds derived in the previous 
subsections ensure that the time average tit-for-tat constraints 
([8]l, as well as the auxiliary variable constraints are 
satisfied on every sample path, regardless of whether or not 
the topology state process uj{t) is ergodic. This is shown in 
the next lemma. Assume there are constants and i/™"^^ 

such that: 

0< Qfe(0 < Vie {0,1,2,...} (22) 

< Hk{t) < yte {0,1,2,...} (23) 

Recall that Lemmas |2] and |4] ensure this holds for Q™"^ — 
Fi^fe+a;™"^ and i/™'^^ — C1+C2F whenever queue backlogs 
are initially 0. 

Lemma 5: If (l22T i and ( |23] | hold, then for any slot t E 
{0, 1,2,.. .} and any positive integer T: 

(a) The tit-for-tat behavior over the interval t E {t, . . . ,t + 
T — 1} satisfies the following for all k E K,: 

f-i-T— 1 

- J2 K^fcM-A-yfeW] < (24) 

T = t 

(b) The auxiliary variable constraints satisfy for all k E K,: 

^ E hkir)-Xkir)]<^r- (25) 
Proof: From ( fTSl l we have for any slot r E {0, 1,2,.. .}: 
Hk{T + 1) > Hkir) + akXkir) - f3k - ykir) 
Summing the above over t E {t,t + 1, . . . ,t + T — 1} gives: 

t+T-l 

Hk{t + T)- Hk{t)> E [akXk{T) ~ Pk - yk{T)] 

T = t 

However: 

Hk{t + T)-Hk{t)<Wr'' 

Combining the above two inequalities and dividing by T yields 
( I24I 1. The inequality ( 1251 ) is proved similarly. □ 
Thus, even if the limits on the left-hand-sides of the in- 
equalities ( l24b and ( |25l l do not converge as T — cx) (possibly 
due to non-ergodic mobility), their lim sup must be less than 
or equal to 0. 



V. Extension to Finite File Sizes 

Now suppose the files requested by users have finite sizes. 
Suppose each user requests at most one file at a time. We say 
a user is in the active state if it is requesting a file, and is in 
the idle state if it does not have any file requests. For each 
k E K,, define Ak{t) to be 1 if user k is active on slot t, and 
else. If Ak{t) — 1, define J^kit) as the set of devices in Af 
that have the currently requested file of user k. Define J-k{t) 
to be the empty set {} if the user is not active on slot t. Define 
Dk{t) as the number of additional required packets for user 
k to complete its file request (where Dk{t) > if and only 
if Ak{t) — 1). When an active user finishes downloading all 
packets of its requested file on some slot t, it goes to the idle 
state, so that Ak{t + 1) = Dk{t + 1) = 0, and Fk{t + 1) = {}. 

We can naturally extend the algorithm developed in the 
previous section to this case (although in this more general 
case the algorithm is a heuristic). The only change is that Fb 
in ([T]i is changed to Fb{t). The algorithm proceeds exactly as 
before, still updating queues and making all 7fc(t) and jiabit) 
decisions the same way for all users on every slot, regardless 
of whether users are active or idle. Of course, the fab{t) 
parameters in (fTSI l and in the receive and send equations (|2]i, 
(|3]l will remove any transmission link (a, b) from consideration 
if user h is idle. However, idle users can still participate in data 
delivery to other users. If one wants to model a user k that 
wishes to turn its peer-delivery functionality "off," one can use 
a modified set of transmission options that sets all outgoing 
links from user k to have rate 0. Intuitively, the algorithm will 
behave well, with performance close to that suggested by the 
infinite file size assumption, when file sizes are large. 

VI. Simulation 

We simulate the algorithm on the cell-partitioned network 
structure of Section III-AI with if = 50 users and a single 
base station as access point. The users move according to a 
Markov random walk on a 4 x 4 grid with 16 subcells. We 
use utility functions 4>k{x) given by ( |20] | with I'k = 1- Each 
cell can support at most one user-to-user packet transmission 
per slot. The base station can transmit to at most one user 
k per slot, with transmission rate Sk{t) that is independent 
over slots and across users with Pr[Sk{t) = 0] = Pr[Sk{t) = 
1] = Pr[Skit) = 2] = 1/3. We simulate over 10*^ slots. 
On slot t = 0, we assign each user k a desired file that is 
independently in the other users with probability p = 0.05, 
which establishes the J-k sets. These sets are held fixed for 
the first third of the simulation, and then independently drawn 
again with a larger probability p = 0.1 and held fixed for the 
second third. New files are again drawn at the beginning of 
the final third of the simulation, with p = 0.07. 

Fig. [U plots the resulting throughput components from the 
base station traffic and peer-to-peer traffic separately, using 
V =10, ak^a = 0.5, Pk = 0.05, x^"^ = 3. Even though 
there are only an average of 50/16 = 3.125 users per cell, and 
in the first third of the simulation there is only a 5% chance 
that a given user has the file desired by another user, the peer- 
to-peer traffic is still more than twice that of the base station 
alone. This further increases in the middle of the simulation 
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when the file availability probability jumps to 10%. Fig. |2] 
shows that the value of Qk{t) never exceeds 10 packets for 
any user k (recall that the worst-case guarantee from Lemma 
|2]is (3fc(t) < F + 3 = 13 packets). All tit-for-tat constraints 
were satisfied, with Hk{t) < 24.6 for all fc € /C and all t. 

Figs. |3] and |4] explore the throughput-backlog tradeoff with 
V (one can also plot the throughput-utility with V to see 
a similar convergence as in Fig. [3]l. Fig. |3] also treats the 
case when the tit-for-tat constraint is made more stringent 
(a — 0.75), in which case throughput is reduced. There was 
no observed change in average queue backlog for a = 0.5 and 
a = 0.75 (the simulated curves look like one curve in Fig. |4] 
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Fig. 1. Average throughput per-user versus time. 
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Fig. 2. Average and worst-case queue backlog per user versus time. 
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Fig. 3. Throughput versus V for a = 0.5 and a = 0.75. 
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Fig. 4. Queue backlog versus V, demonstrating the 0{V) behavior. The 
results for a = 0.5 and a = 0.75 are right on top of each other. The upper 
bound V + 3 is also plotted. 



VII. Conclusions 

This paper develops a simple peer-to-peer scheduling algo- 
rithm for a mobile wireless network. Users can opportunisti- 
cally grab packets from their peers who already have the de- 
sired file in their cache. This capability significantly increases 
the throughput capabilities in the network in comparison to 
a network where users can only download data from a base 
station. Our simulations demonstrate these gains, and illustrate 
that the algorithm is robust to non-ergodic events. Analytically, 
we have shown that if each user requires a single file with in- 
finite length, then the algorithm can achieve throughput utility 
that is arbitrarily close to optimality. Distance to optimality 
depends on a V parameter that also affects an 0{V) tradeoff 
in virtual queue sizes. Our model embeds tit-for-tat constraints 
directly into the stochastic network optimization problem. This 
incentivizes participation and naturally leads to an algorithm 
with reputation queues. Peer-to-peer links between users are 
favored according to the differential reputation of the users. 
Finally, while our results were developed for wireless scenar- 
ios, we note that the algorithm is general and the techniques 
can be used in other contexts, such as in wireline computer 
networks and mixed wireless and wireline networks. 

Appendix — Proof of Lemma|4] 

Here we prove Lemma |4l We state the lemma again for 
convenience: 

Lemma: If all utility functions have right-derivatives 
bounded by finite constants i^k > 0, if Pk > for all k £ K,, 
and if initial backlog satisfies Qfc(O) = Hk{0) = for 
all k E K,, then there are finite constants Ci and C2, both 
independent of V, such that: 

\\&m < + C2V Vte {0,1,2,...} 

where ||0(t)|| is defined: 



\em< Y^Hkity 



keK 



keK 



Qkity 



The constants Ci and C2 are explicitly computed in (|30] l 
and dSTT i below, in terms of the B constant from Lemma [T] 
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Proof: Because our algorithm makes decisions for {^kb{t)) 
and 7fc(t) that minimize the last three terms in the right- 
hand-side of ( [TtI i over all alternative feasible decisions, they 
give a value that is less than or equal to the value when 
the corresponding terms use the alternative feasible decisions 
7fc(*) = f^kb{t) = 0. Thus: 



where: 



-Y,Ht{t)fik 



keK 



keJC 

mm 

where B is the constant that upper bounds B{t) from 
Lemma [T] for all t, /3mm= minfeg/c /^fe. and we have 
used the fact that J^kefc^^it) > Now define 

Co=EkeK.iMxr") - MO)]. We have for all t: 

A{t) < B + VCo - l3r,-un\\Hit)\\ 

Note that: 

A(t) = i||0(i + l)|p-i||0(Oir 



Thus: 

\\&{t 



I |0(i)| r < 2(B + VCo) - 2l3rmn\\Hit) 1 1 

(26) 

Note by Lemma |2] that for all slots t, we have < Qk{t) < 
Qmax, where: 

where v^ax^vaaxkeKVk and x„iax^ 
for any slot t: 

mm < wQim + mm 

< Q max 
Now suppose that on slot t, we have: 

B + VCo 



Thus, 



(27) 



|0(t)||> 



(in 



Combining this with dZTl i shows that if 

B + VCo 



K 

holds, then: 



(28) 



\Hm> 



(29) 



It follows that if (l28]l holds, then (by combining (l26]l and (|29]l): 

\m + i)f-\\@{t)f <o 

Thus, ||0(t)|| cannot increase if (l28T l holds on slot t. Because 
the initial queue backlog is less than the threshold given in 
it follows that for all t: 

B + VCo 



m)\\< 



K 
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Pmin 

where g is defined as the maximum possible increase in 
||0(<:)|| in one slot. Because both Qk{t) and Hk{t) can 
increase by at most Xmax in one slot, we have g < XmaxV^K. 
Thus, for all slots t we have: 

liewil < ^ + 



< 




K 



B/I3r, 



rr 




(30) 
(31) 

□ 
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